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Abstract. In terms of the concepts of state and state transition, a new heuris- 
tic random search algorithm named state transition algorithm is proposed. For 
continuous function optimization problems, four special transformation opera- 
l | tors called rotation, translation, expansion and axesion are designed. Adjusting 

measures of the transformations are mainly studied to keep the balance of ex- 

Oploration and exploitation. Convergence analysis is also discussed about the 
algorithm based on random search theory. In the meanwhile, to strengthen 
the search ability in high dimensional space, communication strategy is in- 
troduced into the basic algorithm and intermittent exchange is presented to 
prevent premature convergence. Finally, experiments are carried out for the 
j*H algorithms. With 10 common benchmark unconstrained continuous functions 

■ i used to test the performance, the results show that state transition algorithms 

are promising algorithms due to their good global search capability and con- 
£f) vergence property when compared with some popular algorithms. 
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^j- 1. Introduction. The concept of state means to a situation which a material sys- 

tern maintains, and it is characterized by a group of physical qualities. The process 
s ^ of a system turning from a state to another is called state transition, which can be 

X/*} described by a state transition matrix. The idea of state transition was created by a 

Russian mathematician named Markov when he expected to represent a specific sto- 
O^l chastic process (known as Markov process) [22]. Not only in communication theory 

but also in modern control theory, state transition matrix is of great importance. 
^ For instance, in modern control theory, it can determine the stability of a system. 

K^j In almost all branches of engineering, including system design, tactical planning, 

^ system analysis, process management and control, and model parameter adjust- 

ed ment, optimization techniques have found wide applications [30]. Generally speak- 

ing, the methods used for solving such optimization problems can be classified into 
two categories: deterministic and stochastic, in which, stochastic methods are sub- 
divided into evolutionary algorithms and metaheuristic algorithms. The traditional 
deterministic algorithms include Hooke- Jeeves pattern search [11] and hill-climbing, 
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evolutionary algorithms contain genetic algorithm (GA)[8, 7], evolutionary pro- 
gramming, evolution strategies, and genetic programming, while metaheuristic al- 
gorithms consist of simulated annealing, particle swarm optimization (PS0)[12, 25], 
differential evolution(DE) algorithm [24], etc. On the other hand, the commonly 
used numerical algorithms for engineering optimization problems can also be cate- 
gorized into two classes: direct search methods and gradient-based methods. The 
direct search methods comprise the simplex search, Powell's conjugate direction 
method and random search, while the gradient-based methods include the New- 
ton's (basic-, modified-, quasi-) and conjugate gradient methods[15]. In the same 
time, hybrid methods, combining of deterministic and stochastic or direct search and 
gradient-based, are also proposed to draw on each other's strengths[2, 29, 16, 32]. 

According to the No Free Lunch Theorem [28], no search algorithm is better than 
other algorithms on the space of all possible problems. This paper introduces a 
new method for optimization of continuous nonlinear functions, which belongs to 
metaheuristic random search. Because of its foundation on state and state transi- 
tion, the method is called state transition algorithm (STA)[33, 34]. The algorithm 
has roots in three main component methodologies. One is the random optimization 
theory, the others are population-based approach and then space transformation 
method. In this paper, it focuses on four operators named rotation, translation, 
expansion and axesion transformation as well as the communication strategy in 
state transition algorithm. Compared with some state-of-the-art optimization al- 
gorithms, RCGA[26], CLPSO[14], and SaDE[20], which are improved versions of 
GA, PSO and DE, the experimental results show that STAs are comparable and 
promising algorithms. 

2. The basic state transition algorithm. Considering the following uncon- 
strained optimization problem 



In a deterministic view, it usually adopts iterative method to solve the problem 



where, Xk is the A;th iteration point, is the A;th step size and dk is the A;th search 
direction. 

The common selection of a step is by exact line search or inexact line search. 
While the techniques of search direction include steepest descent method, conju- 
gate gradient methods, Newton's methods, alternating directions, and conjugate 
direction methods[31]. 

In a way, the iterative methods aim to search for a direction and a step in an itera- 
tion. Though these methods utilize the gradient information explicitly or implicitly, 
they have their inherent defects. For one thing, it is computationally difficult. For 
another, it only indicates the local information. In the view of global optimization, 
the direction of gradient is just a way standing for direction, and it has no substan- 
tial effect on searching for a global optimum. If the iterative method is concerned in 
a state and state transition way, then an iterative point can be regarded as a state, 
the process of searching for a direction and a step will equate to a state transition 
process, and through a state transition, a new state will be created. 

In the point of stochastic, it can also understand the evolutionary algorithms 
and metaheuristic algorithms in a state and state transition way. For example, 
genetic algorithm, its each individual of a generation can be considered as a state, 



min fix). 



(1) 



%k+l = %k + CLkdk, 



(2) 
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and the updating process of using genetic operators such as selection, crossover and 
mutation can equate to state transition processes. In the same way, particle swarm 
optimization, the flock updating its velocity and position, and differential evolution, 
adding the difference vector of two randomly chosen vectors to a target vector, can 
also be regarded as state transition processes. 

In terms of the concept of state and state transition, a solution to a specific 
optimization problem can be described as a state, the operators of optimization 
algorithms can be considered as state transition, and the process to update current 
solution will become a state transition process. 

Through the above analysis and discussion, it defines the following form of state 
transition 

%k+i — AkXk + BkUk / q \ 

2/fc+l = ZO/e+l) 

where, x k stands for a state, corresponding to a solution to the optimization prob- 
lem; then, Ak and B k are state transition matrixes, which can be regarded as 
operators of optimization algorithm; u k is the function of state Xk and historical 
states; while / is the cost function or evaluation function. 

2.1. State transformation operators. As a matter of fact, operators such as 
reflection, contraction, expansion and rotation are widely used in simplex optimiza- 
tion method[19, 3, 13], which is especially popular in the fields of chemistry, chemical 
engineering, and medicine. However, they always fail to lead to continued progress 
and are not applicable to a wide range of functions. 

In the theory of space and transformation, rotation matrices are only defined 
for two and three dimensional transformation. For example, the two dimensional 
rotation matrix is f cos ® ~ si f 1 . 

[ sinv cosB J 

Using various types of space transformation for reference, in this paper, it de- 
fines the following four special state transformation operators to solve continuous 
function optimization problems. 
(1) Rotation transformation 

Xk+i =x k + a 1 R r x k , (4) 
n\\x k \\ 2 

where, x k G 3? n , a is a positive constant, called rotation factor; R r G 3? nxn , is 
random matrix with its entries obeying the uniform distribution in the range of [-1, 
1] and || • || 2 is 2-norm of vector or Euclidean norm. Then, it will prove that the 
rotation transformation has the function of searching in a hypersphere. 



Proof. 



\\Xk+l -Xkh = \\ a II 1 II R rXkh 

n\\x k \\ 2 



a 



n\\x k \\2 
a 

n\\x k \\ 2 



\RrXkh (5) 



<^^\\Rr\\rnJ\Xkh<a 



□ 



(2) Translation transformation 



Xk+i = x k + /3R t 7^ — Xk \ , (6) 

\\X k - X k -l\\2 
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where, f3 is a positive constant, called translation factor; R t G 3? is a random variable 
with its components obeying the uniform distribution in the range of [0,1]. It is 
obvious to find the translation transformation has the function of searching along 
a line from x^-i to Xk at the starting point Xk, with the maximum length of (3. 

(3) Expansion transformation 

Xk+i = x k +jR e Xk, (7) 

where, 7 is a positive constant, called expansion factor; R e G 5R nxn is a random 
diagonal matrix with its elements obeying the Gaussian distribution (in this study, 
standard normal distribution). It is also obvious to find the expansion transforma- 
tion has the function of expanding the components in x k to the range of [-00, +00], 
searching in the whole space. 

(4) Axesion transformation 

Xk+i = x k + SR a x k , (8) 

where, S is a positive constant, called axesion factor; R a G !ft nxn is a random 
diagonal matrix with its entries obeying the Gaussian distribution and only one 
random position having nonzero value. The axesion transformation aims to search 
along the axes and strengthens single dimensional search. 

2.2. State transformation algorithm. Before the state transition algorithm, it 
is necessary to introduce the basic random optimization[17, 1, 9, 10]. Considering 
the above unconstrained optimization problem, the procedure of the basic random 
optimization can be outlined in the following pseudocode. 

1: Initialize feasible solution xo, and set k ^— 
2: repeat 

3: k <- k + 1 

4: Generate a Gaussian random number vector r 

5: Xtrail <~ Xk-l + T 

6: if f(xtraii) < f(xk-i) then 

7 ; Xk i Xt ra n 

8: else 

9: X k <- X k -! 

10: end if 

11: until the specified termination criterion is met 

For one thing, as a metaheuristic random method, the state transition algorithm 
is similar to the basic random optimization [17]. The only difference is that a can- 
didate solution set is generated by the four special operators, while a new trail is 
selected following the same way as that of the basic random optimization, which 
means that the "greedy criterion" is used in selecting the new state. By the way, 
a candidate solution set is created by some times of transformation. The times of 
the transformation or the size of the set is called search enforcement (SE), and the 
translation operator is only performed when a better new trail is found. 

For another, as a stochastic algorithm [5], the dealing with dynamic balance be- 
tween diversification (exploration of the solution space) and intensification (ex- 
ploitation of the accumulated knowledge) is also significant in state transition algo- 
rithm. Due to their intrinsic properties, the rotation is chosen for exploitation, the 
expansion is for exploration, the translation is selected as to maintain equilibrium 
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between them, and axesion is proposed to strength the single dimensional search. 
The main process of STA is shown in the pseudocode as follows 
1: repeat 

if a < a m i n then 

OL i ft m ax 

end if 

Best expansion(funfcn,Best,SE,/3,7) 
Best ^— rotation(funfcn,Best,SE,a,/3) 
Best <— axesion(funfcn,Best,SE,/3,£) 

. ^ fc 

until the specified termination criterion is met 



> expansion transformation 
> rotation transformation 
> axesion transformation 



As for detailed explanations, expansion function in above pseudocode is given as 
follows for example 
1: oldBest <- Best 
2: fBest <— feval(funfcn, oldBest) 
3: State ^— op_expand(Best,SE,7) 
4: [newBest,fGBest] ^— fitness (funfcn, State) 

5: if fGBest < fBest then > greedy criterion 

6: fBest <- fGBest 
7: Best newBest 

8: State <— op_translate(oldBest,Best,SE, / 5) 
9: [newBest, fGBest] ^— fitness (funfcn, State) 

10: if fGBest < fBest then > greedy criterion 

11: fBest <- fGBest 

12: Best ^— newBest 

13: end if 
14: end if 



2.3. Parameters analysis in STA. In state transition algorithm, there are five 
important parameters, namely search enforcement (SE), rotation factor a, transla- 
tion factor /?, expansion factor 7 and axesion factor 5. It is easy to understand that 
the larger the search enforcement, the higher the intensity of search, and vise versa. 
However, the larger search enforcement will cause larger computational complexity. 
In this paper, the search enforcement is recommended to use the same size as the 
dimension of the optimization problem. 

When SE is constant, taking the exploration and exploitation into consideration, 
the strategy of adjusting parameters of the four operators is significant. To make the 
deeper exploitation, the smaller rotation factor is needed. Especially, the rotation 
factor will vary in a declining way from a positive constant till zero to gain a high 
precision solution. In the meanwhile, there are two schemes to regulate the a. One 
is to adjust the parameter in an inner loop, namely, decreasing the rotation factor 
from a start constant to the end in the operation of rotation transformation [33]. 
The other is to adjust the parameters in an outside loop, that is to say, decreasing 
the rotation factor according to the iterations. To balance the global search and 
local search timely, the latter scheme is adopted in the paper; however, the rota- 
tion factor is decreasing itself from a maximum value to a minimum value in an 
exponential way with base /c, which is called lessening coefficient [34] , as described 
in the pseudocode of STA. By the way, extra tests have testified the effectiveness 
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of the scheme. 

As for the remained control parameters, for example, the larger the translation 
factor, the longer STA searches along a straight line. However, the magnitude of 
translation factor has great influence on exploitation and exploration. The large 
translation factor will facilitate the exploration, while the small translation factor 
benefits the exploitation. Similarly, the same phenomenon exists in the selection 
of expansion and axesion factors. Taking the complexity of adjusting strategies for 
these control parameters into consideration, we keep them fixed in current version 
of STA for simplicity. 

2.4. The convergence analysis of STA. The convergence of stochastic opti- 
mization algorithms has been heatedly discussed. For instance, genetic algorithm 
was analyzed by means of homogeneous finite Markov chain[21], particle swarm 
optimization was studied to investigate particle trajectories in a discrete system 
view [4], and convergence property of differential evolution was also discussed in [24]. 

As a metaheuristic random optimization algorithm, the convergence analysis of 
STA will follow the same way as random search methods. In fact, the probability 
of random search algorithm for finding global minimum being equal to 1 was stated 
by Solis and Wets [23]. That is to say, the STA will satisfy the similar convergence 
performance of random optimization algorithm, and readers who are interested in 
convergence analysis are referred to their work for details. 

3. Communication strategy into state transition algorithm. In a way, the 

basic state transition algorithm is individual-based, and an individual searches in 
its neighborhood. The difference between the basic state transition algorithm and 
other random optimizations is that the search space is normalized or specialized. 

The population-based approach is prevalent in metaheuristic algorithms, such 
as genetic algorithm, particle swarm optimization and differential evolution. Let's 
name the basic state transition algorithm STAI, the improved state transition al- 
gorithm based on population is called STAII with the number of states denoted 
as SN. In the meanwhile, some communication strategies are necessary to man- 
age the individuals for sharing information, which is important in population-based 
methods. 

3.1. Crossover operator. Individual communication can be implemented in var- 
ious ways, of which crossover operation is quite common, especially in genetic algo- 
rithms. 

Let Xi and X 2 be individual components of current generation, Y\ and Y 2 are 
the offspring components, some canonical crossover operators are displayed in the 
following. 

(1) Michalewicz's arithmetical crossover[18] 



where, a is either a constant or a variable whose value depends on the age of 
population. 

(2) Wright's linear crossover [2 7] 



Yi = olX x + (1 - a)X 2 
Y 2 = aX 2 + (1 - a)X 1 



(9) 



Y 1 = 1.5Xi - 0.5X 2 ,F 2 = -0.5Xi + 1.5*2, *3 = (*i +* 2 )/2 



(10) 
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(3) Kalyanmoy Deb's simulated binary crossover [6] 

Y x = 0.5[(1 - + (1 + p)X 2 \ , 
r 2 = 0.5[(l + /3)X 1 + (l-/3)X 2 ] ' {lL) 

where, is a random variable, obeying the following probability distribution 

fp(/3) = 0.5(f/ c + 1)/?"= < < 1, 
= 0.5(^ + 1)^ /3>1, 

here, p(-) is the probability density function, r\ c is the distribution index, which 
determine how well spread the children will be inherited from their parents. 

(4) The proposed crossover 

Fi = SX! + (1 - S)X 2 , 

Y 2 = V Xy + (1 - rj)X 2 , ^ 

where, 5 and 77 are independent variables, which obey the 0-1 distribution. 

In this proposed crossover, crossover operation means for each component of a 
pair of individuals, components exchange or maintain their information completely. 

3.2. Intermittent exchange. Different from other population based algorithms, 
all of the individuals in STAII are elites, and they develop themselves trough state 
transformation, which is referred as self learning. When the communication strat- 
egy is introduced, the individual can contact with each other to better develop 
themselves. However, it may bring about some disadvantageous effects. If the fre- 
quency of individual communication is too high, individuals are apt to imitate each 
other utterly, which will cause premature convergence. In this paper, intermittent 
exchange is proposed to solve the issue, that is to say, individual communication 
occurs at a certain frequency, where the frequency is named communication fre- 
quency (CF). 

The communication strategy is adopted to share information among individuals, 
and it is regulated by communication frequency. If CF is small enough, it will 
equate the situation without the communication strategy. When CF is too large, 
individuals are easily trapped into imitating each other, causing the premature con- 
vergence, that is to say, a moderate CF is appropriate. In this paper, we recommend 
to use the same magnitude as square root of the maximum iterations. 

When the exchange condition is satisfied, the proposed crossover operator will 
be performed. Each state will communicate with all of the other states, to make 
sure that useful information is completely shared. 

3.3. The framework of STA with communication strategy. Through the 
above discussion and analysis, by introducing in communication strategy, the pseu- 
docode of the kernel of state transition algorithm can be described as shown in the 
following 

1: repeat 

2: if a < a m { n then 

3: Ol, 4 ^max 

4: end if 

5: State ^— self_learning(funfcn,State,SE,a,/3,7,(5) > self learning 

6: a <- f 

fc 

7: if mod(iter,CF)==0 then > intermittent exchange 

8: State communication(funfcn, State) 
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9: end if 

10: [Best,£Best] <(— fitness (funfcn, State) 

11: until the specified termination criterion is met 

In the process of the algorithm, the self -learning means each state in the state set 
will be performed on four state transformation operators, while in communication 
function, the intermittent exchange is adopted at intervals. The flowchart of the 
algorithm is outlined in Figure 1. 




Initialize a state set 



Current state set 



State 
transformations 



Updated state set 



Intermittent 
exchange 




Figure 1. the flowchart of STAII 



4. Experiments and results. To compare the proposed state transition algo- 
rithm with previously mentioned RCGA, CLPSO and SaDE, two experiments are 
arranged. The first experiment is mainly for two dimensional functions, and the 
other focuses on ten dimensional functions test. In the same time, both STAI and 
STAII are carried out, for comparison with other algorithms as well as themselves. 

4.1. Test functions. In order to test the performance of STA, ten common bench- 
mark functions are selected for the experiment. Seven functions are multidimen- 
sional functions of various modals and the other three are two dimensional functions, 
which are listed in Table 1, while the landscapes of two dimensional functions are 
plotted in Figure 2. 



Rotation 
Translation 
Expansion 

Axesion 
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Table 1. Benchmark functions for test in this paper 



Name of function 


Function definition 


Range 


f 

J min 


Spherical 


h = Z) n i x ? 


[-100,100] 





Rastrigin 


h = - Wcos(2nxi) + 10) 


[-5.12,5.12] 





Griewank 


A = 4KoE?=i*?-nr««i^i+i 


[-600,600] 





Rosenbrock 


h = E?-i(100(* 1+ i - xf) 2 + ( Xi - l) 2 ) 


[-30,30] 





Schewefel 


f5 = J2? =1 [-Xisin(V\x~\)] 


[-500,500] 


-418.9829n 


Ackley 


fe = 20 + e- 20exp(-0.2^ 1 £J* =1 *?) 


[-32,32] 





- eX P(^ E"=l COs(27TXi)) 




Michalewicz 


/7 = ELi««N»«(^) 20 


[0,*] 






sin(Y // cc2+cc|) 2 — 0.5 
-°- 5 + (l+0.001(x2 +:c 2))2 


[-100,100] 




S chaffer 





Easom 


fg = —COs(xi)cOs(x2)x 

exp( — (xi — 7r) — (x2 — tt)) 


[-100,100] 


-1 








/io = [1 + (xi + x 2 + 1) 2 (19 - Mxi + 3x? 






Goldstein-Price 


-14x2 + 6x1x2 + 3x|)] x [30 + (2xi - 3x 2 ) 2 
(18 - 32xi + 12x1 + 48x 2 - 36xiX2 + 27x|)] 


[-2,2] 


3 



4.2. Parameters setting. All of the algorithms were run on MATLAB (Version 
R2010b) software platform. For simplicity and normalization, the experiment spec- 
ifies all the control parameters of transformation operators in STAI and STAII 
starting at 1. Commonly, the variation of a parameter follows a linear, exponential 
or a logistic way. In this paper, the exponential way is accepted for its rapidity, 
of which the base is 2 in the experiment. In view of the operational precision of 
MATLAB in short format, the minimum a factor fixed at le-4 is enough for the 
situation. 

As for RCGA, we use the same parameter settings as in[26]. Then, for CLPSO 
and SaDE, we use the MATLAB codes provided by the author in[14, 20] with minor 
revisions for this experiment. 

Programs were run independently for 30 trails, and for each trail, the population 
scale is 30, and the maximum iteration is 1000. The detailed parameters of STAI 
and STAII are shown in Table 2 and Table 3, respectively. 

Table 2. Parameters setting of STAI 



Parameter 


Value 


SE 


30 


a 


1 le-4 




1 


7 


1 


S 


1 


fc 


2 



4.3. Results and discussion. For comparison, some common statistics are in- 
troduced. The best means the minimum of the results, the worst indicates the 
maximum of the results, and then it follows the mean, median and st. dev. (standard 
deviation). In some way, these statistics are able to evaluate the search ability and 
solution accuracy, reliability and convergence as well as stability. To be more spe- 
cific, the best indicates the global search ability and solution accuracy, the worst 
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Table 3. Parameters setting of STAII 



Parameter 


Value 


SN 


30 


SE 


10 


CF 


50 


a 


1 le-4 


(3 


1 


7 


1 


S 


1 


fc 


2 



and the mean signify the reliability and convergence, while the median and st.dev. 
correspond to the stability. 

Results for two dimensional functions optimization are listed in Table 4, while 
results for ten dimensional functions optimization can be found in Table 5. On the 
other hand, illustrations of the average fitness in 30 simulations are given in Figure 
3 and Figure 4 for two dimensional and ten dimensional functions, respectively. The 
average fitness curve can visually depict the search ability and convergence perfor- 
mance. In the following paragraphs the analysis of the results for each functions 
will be discussed separately. 

Spherical Function: as can be seen from the results, all of the algorithms can 
find the global optimum with high solution precision and have good reliability as 
well as stability for this function in terms of two and ten dimensions. But the STAI 
and STAII are able to search much deeper than other three algorithms, which can 
also be observed in subfigure (A) of Figure 3 and Figure 4. In the subfigure (A), we 
can see that STAs can converge much faster than the remained methods. While for 
STAI and STAII, it is found that STAI has a little faster convergence performance 
than that of STAII. 

Rastrigin Function: we can see from the results that all of the algorithms 
can find the global optimum and have good reliability as well as stability in two 
dimension. For the ten dimensional problem, the global optimum can also be found 
by all algorithms; however, STAI and STAII have better statistical performances 
than other three algorithms especially described by the worst RCGA and SaDE 
can not achieve the best occasionally, and the mean of RCGA is not satisfactory. 
From subfigure (B) of Figure 3 and Figure 4, we can also find that STAs converge 
much faster than other algorithms, and higher solution precision can be obtained. 
In this time, the process of STAII is slightly faster than that of STAI. 

Griewank Function: from the results, we can find that most algorithms have 
the ability to achieve the best and have both reliability and stability in the two di- 
mensional function except the RCGA. While for the ten-dimension function, these 
methods are able to find the global optimum but the statistical performances are 
not satisfactory except STAII, the results of which are excellent. In subfigure (C) 
of Figure 3 and Figure 4, it can be found that STAII converge fastest and have 
highest solution precision of all. 

Rosenbrock Function: all of the algorithms have no problem to find the global 
optimum for this function in two-dimensional space, but the worst of RCGA indicate 
that it is not reliable and a bit deficient for the function. Regarding corresponding 
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Table 4. Comparisons among various algorithms on test functions (2D) 



Fen 


Statistic 


RCGA 


CLPSO 


SaDE 


STAI 


STAII 




best 


1.5795e-028 


2.7344e-091 


2.9229e-196 










median 


4.9090e-024 


2.6808e-087 


4.2548e-189 








h 


mean 


1.5458e-022 


5.9580e-082 


6.6729e-188 










worst 


2.5849e-021 


1.7771e-080 


9.6210e-187 










st. dev. 


4.7939e-022 


3.2440e-081 













best 



















median 

















h 


mean 



















worst 



















st. dev. 



















best 



















median 


0.0074 














h 


mean 


0.0042 


1.2460e-009 













worst 


0.0074 


3.7377e-008 













st. dev. 


0.0037 


6.8241e-009 













best 


7.0832e-008 


9.2890e-010 





1.0092e-012 


4.2592e-014 




median 


0.0085 


3.9984e-007 





1.0900e-011 


3.9400e-012 


h 


mean 


1.0364 


3.9260e-005 





1.2571e-011 


4.4217e-012 




worst 


26.2801 


6.3652e-004 





4.7764e-011 


1.4588e-011 




st. dev. 


4.7802 


1.4088e-004 





1.0692e-011 


3.9023e-012 




best 


-837.9658 


-837.9658 


-837.9658 


-837.9658 


-837.9658 




median 


-837.9658 


-837.9658 


-837.9658 


-837.9658 


-837.9658 


h 


mean 


-822.1740 


-837.9658 


-837.9658 


-837.9658 


-837.9658 




worst 


-719.5274 


-837.9658 


-837.9658 


-837.9658 


-837.9658 




st. dev. 


40.9496 








1.5939e-013 


1.0970e-013 




best 


2.0428e-014 


-8.8818e-016 


-8.8818e-016 


-8.8818e-016 


-8.8818e-016 




median 


3.1681e-012 


-8.8818e-016 


-8.8818e-016 


-8.8818e-016 


-8.8818e-016 


h 


mean 


4.7516e-011 


-8.8818e-016 


-8.8818e-016 


-8.8818e-016 


-8.8818e-016 




worst 


9.6937e-010 


-8.8818e-016 


-8.8818e-016 


-8.8818e-016 


-8.8818e-016 




st. dev. 


1.8319e-010 
















best 


-1.8013 


-1.8013 


-1.8013 


-1.8013 


-1.8013 




median 


-1.8013 


-1.8013 


-1.8013 


-1.8013 


-1.8013 


h 


mean 


-1.8013 


-1.8013 


-1.8013 


-1.8013 


-1.8013 




worst 


-1.8013 


-1.8013 


-1.8013 


-1.8013 


-1.8013 




st. dev. 


9.0336e-016 


9.0336e-016 


9.0336e-016 


2.2063e-011 


7.6618e-012 




best 



















median 


0.0097 


9.3299e-012 











h 


mean 


0.0071 


4.1836e-004 


6.4773e-004 










worst 


0.0097 


0.0097 


0.0097 










st. dev. 


0.0044 


0.0018 


0.0025 










best 


-1.0000 


-1.0000 


-1.0000 


-1.0000 


-1.0000 




median 


-1.0000 


-1.0000 


-1.0000 


-1.0000 


-1.0000 


h 


mean 


-1.0000 


-1.0000 


-1.0000 


-1.0000 


-1.0000 




worst 


-1.0000 


-1.0000 


-1.0000 


-1.0000 


-1.0000 




st. dev. 











1.0124e-012 


2.6511e-013 




best 


3.0000 


3.0000 


3.0000 


3.0000 


3.0000 




median 


3.0000 


3.0000 


3.0000 


3.0000 


3.0000 


ho 


mean 


3.0000 


3.0000 


3.0000 


3.0000 


3.0000 




worst 


3.0000 


3.0000 


3.0000 


3.0000 


3.0000 




st.dev. 


2.5135e-015 


1.5317e-015 


1.2669e-015 


2.5697e-010 


1.2191e-010 
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Figure 3. Average fitness of the two-dimensional functions from 
fi to /io 
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Table 5. Comparisons among various algorithms on test functions (10D) 



Fen 


Statistic 


RCGA 


CLPSO 


SaDE 


STAI 


STAII 




best 


4.0118e-012 


1.5229e-012 


1.0686e-053 










median 


4.1019e-011 


5.3464e-011 


1.6873e-051 








fl 


mean 


8.4302e-011 


6.0282e-011 


3.5549e-051 










worst 


5.6332e-010 


1.6279e-010 


2.7214e-050 










st. dev. 


1.1766e-010 


4.9198e-011 


6.1159e-051 










best 


1.0814e-011 


1.9806e-006 













median 


2.9849 


1.0401e-005 











h 


mean 


2.6864 


2.7769e-005 


0.0332 










worst 


5.9697 


2.5009e-004 


0.9950 










st. dev. 


1.5491 


4.7363e-005 


0.1817 










best 


3.2914e-010 


2.6061e-005 













median 


0.0492 


0.0019 











h 


mean 


0.0582 


0.0038 


5.7529e-004 


0.0166 







worst 


0.1699 


0.0166 


0.0099 


0.0738 







st. dev. 


0.0439 


0.0045 


0.0022 


0.0260 







best 


0.0662 


0.6841 


8.3515e-012 


2.6607e-005 


7.3949e-005 




median 


7.2483 


4.0404 


3.4637e-004 


1.7823 


0.2809 


U 


mean 


7.0110 


4.3775 


0.3415 


2.3266 


0.4095 




worst 


9.2754 


12.3253 


3.9866 


21.8603 


1.5228 




st. dev. 


1.4466 


2.8042 


1.0098 


3.7249 


0.4124 




best 


-3.9530e+003 


-4.1898e+003 


-4.1898e+003 


-4.1898e+003 


-4.1898e+003 




median 


-3.8345e+003 


-4.1898e+003 


-4.1898e+003 


-4.1898e+003 


-4.1898e+003 


h 


mean 


-3.7832e+003 


-4.1898e+003 


-4.1898e+003 


-4.1898e+003 


-4.1898e+003 




worst 


-3.3608e+003 


-4.1898e+003 


-4.1898e+003 


-4.1898e+003 


-4.1898e+003 




st. dev. 


151.3665 


5.5006e-008 


2.7751e-012 


1.1734e-011 


1.9256e-012 




best 


6.6339e-007 


1.5533e-006 


-8.8818e-016 


-8.8818e-016 


-8.8818e-016 




median 


2.8352e-006 


4.0369e-006 


2.6645e-015 


-8.8818e-016 


-8.8818e-016 


h 


mean 


3.1524e-006 


4.8197e-006 


2.3093e-015 


2.9606e-016 


-8.8818e-016 




worst 


9.6980e-006 


1.3052e-005 


2.6645e-015 


2.6645e-015 


-8.8818e-016 




st. dev. 


2.1010e-006 


3.2984e-006 


1.0840e-015 


1.7034e-015 







best 


-9.6154 


-9.6601 


-9.6602 


-9.6602 


-9.6602 




median 


-9.2604 


-9.6598 


-9.6602 


-9.6602 


-9.6602 


h 


mean 


-9.2425 


-9.6588 


-9.6513 


-9.1797 


-9.6602 




worst 


-8.7143 


-9.6549 


-9.6135 


-7.6602 


-9.6602 




st. dev. 


0.2265 


0.0018 


0.0173 


0.6044 


1.9138e-009 



ten dimensional problem, only SaDE and STAs can find the best with a low proba- 
bility. In this case, SaDE achieves best results, followed by STAII. From subfigure 
(D) of Figure 3 and Figure 4, we can find that STAs still converge faster than other 
algorithm but with not higher solution precision than SaDE. Compared with STAI, 
STAII have much better statistical performances, which are indicated by the worst 
and the mean. 

Schewefel Function: as for the function, only RCGA can not find the global 
optimum for both two and ten dimensions; furthermore, the median and st.dev. 
also show that the RCGA is not stable and reliable for this function. Other algo- 
rithms achieve the best as well as good reliability and stability because the st.dev 
approaches zero for these methods. From subfigure (E) of Figure 3 and Figure 4, 
the faster convergence speed belongs to the STAs as well. While for STAs, it shows 
that STAII converge faster than STAI for the function. 

Ackley Function: it seems that all of the algorithms have no problem in finding 
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(g) fr 

Figure 4. Average fitness of the ten-dimensional functions from f\ to fa 

the global optimum for the function in terms of two and ten dimensions. The sta- 
tistical performances of results are satisfactory for all methods because the st.dev. 
approaches zero. In subfigure (F) of Figure 3 and Figure 4, we can find that STAs 
also have faster convergence speed than other algorithms and the solution precision 
is also higher for STAs when compared with others. 

Michalewicz Function: all of the algorithms can achieve the global optimum 
for this function in two dimension, and the statistical performances is satisfactory 
in this case. While for the function with ten dimension, only STAII are able to 
achieve the same statistics as the results in two dimensional function. More specif- 
ically, RCGA and CLPSO are not able to find the best. The worst of STAI show 
that it is not reliable sometimes. 
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Schaffer Function: the global optimum can be found by all the algorithms; 
however, only STAs can achieve reliable and stable performance for this function, 
as indicated by the mean and the st.dev.. Other methods fail to find the best oc- 
casionally, which is described by the worst, that is to say, other algorithm are not 
reliable for the function. In the case, STAII converge much faster than STAI, as 
illustrated by subfigure (H). 

Easom Function: all of the algorithms are able to find the global optimum 
with a high probability. The st.dev. indicates that the statistical performance are 
also fine for all methods. The subfigure (I) of Figure 3 shows that STAs have better 
convergence performance again. 

Goldstein-Price Function: as described in Table 4, global optimum can be 
found by all algorithms, the results of which are satisfactory because the st.dev. 
approaches zero. The subfigure (J) of Figure 3 shows that the convergence speed is 
fine for all methods but the STAs are much better to some extent. 

Over all, some explanations can be given on the behavior of average fitness curves. 
As shown in Figure 3 and Figure 4, the curves of STAs change steadily during the 
iteration process in most cases, There are two reasons that account for the phenom- 
enon. Firstly, the rotation guarantees the steady decrease of the curves because the 
rotation factor changes from a maximum value to a minimum value in a periodical 
way, which prevents current best state from changing sharply. If other transforma- 
tions do not work, then rotation will help searching in depth with a high precision. 
Secondly, expansion and translation are beneficial for searching in a new area, while 
the axesion is proposed to strength the single dimensional search, which are all ad- 
vantageous for the decrease of the curves. 

But every once in a while, especially described by the average fitness of f± and fj 
in ten dimension, STAs fail to find the global optimum. As declared in Part 2.2, rota- 
tion transformation is used for local search, expansion, translation, and axesion are 
helpful for global search. In current STAs, their control parameters (transformation 
factors) are determined by experimental experience for simplicity. The failure of 
STAs for /4 and fa occasionally indicate that the global search transformations 
need to be deeply studied. Anyway, the smaller rotation factor will facilitate the 
exploitation and the bigger expansion, translation and axesion factors will benefit 
the exploration, though how to balance them are still pending. Regarding the in- 
fluence of the CF, we can find that STAII has stronger search ability than STAI 
as the introducing of intermittent exchange. As illustrated by the average fitness 
curves, the fitness by STAII can still decrease even if that of STAI is already steady, 
that is to say, the communication strategy can help share information and prevent 
premature convergence. If the CF is larger, more information will be shared, and 
if the CF is small, self development will be enhanced. 

By the way, the searching time required for STAs is infinity in theory, which 
is the consequence of random search methodology. However, in practice, we can 
stop the iteration process by presetting some criteria, for example, the prescribed 
maximum iterations, or when the fitness is unchanged for a number of times. In 
this paper, the maximum iterations is used. 

5. Conclusion. Based on state and state transition, the STA, not only has a simple 
form but also possess clear geometric significance, which is easy for understanding. 
Concerning the continuous function optimization problems, it presents the state 
transformations including rotation transformation, translation transformation and 
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expansion transformation as well as axesion transformation. The paper focuses on 
the unconstrained optimization problems, and it studies mainly on the approaches 
of transformations. Furthermore, to enhance its performance in high dimensional 
functions optimization, communication strategy has been introduced, and the in- 
termittent exchange is proposed to strength the search ability as well as prevent 
premature convergence. Using 10 benchmark functions for testing, compared with 
some distinguished optimization algorithms, it shows that STAs have fine perfor- 
mance in terms of global search ability and convergence accuracy, which confirms 
the effectiveness of the proposed algorithms. 

On the other hand, distinguished from other population-based algorithms, STA 
is not originated from simulating natural intelligence, but it takes advantages of 
the space structure of a function, which opens a new window for optimization. In 
the paper, control parameters of STAs are not studied deeply, and they are only 
determined by the experimental experience or for simplicity. In our future work, 
these problems will be focused on to better develop the state transition algorithms. 

Acknowledgments. We would like to thank the anonymous referees for their valu- 
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